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Abstract 


This  is  the  final  report  of  the  research  conducted  at  Stanford  Electronics  Laboratories 
under  the  sponsorship  of  the  Joint  Services  Electronics  Program  from  March  1 .  1 997 
through  March  31,  2001.  This  report  summarizes  the  areas  of  research,  identifies  the 
most  significant  results  and  lists  the  dissertations,  publications  and  presentations 
sponsored  by  the  contract  (DAAG55-97-1-01 15). 
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Introduction 

This  report  summarizes  the  activities  in  the  research  programs  at  the  Stanford 
Electronics  Laboratories  sponsored  by  the  Joint  Services  Electronics  Program  under  contract 
DAAH04-94-G-0058.  This  contract  is  monitored  by  the  Army  Research  Office,  Research 
Triangle  Park,  North  Carolina. 

This  report  covers  a  four  year  wind  down  period  after  50  years  of  the  JSEP  program  at 
Stanford.  This  period  of  research  supported  only  students  who  had  been  previously  supported  to 
enable  them  to  complete  their  PhD  degrees  without  switching  research  topics 


INVESTIGATION  OF  FABRICATION  AND  TRANSPORT  IN  QUANTUM  DOTS 

PRINCIPAL  INVESTIGATOR:  J.  S.  Harris,  Jr. 

GRADUATE  STUDENTS:  B.  Shimbo  and  D.  Stewart 

1.  Scientific  Objectives 

Present  semiconductor  devices  and  ICs  are  based  upon  classical  charge  transport  in 
semiconductors.  As  device  dimensions  continue  to  shrink,  they  are  rapidly  approaching  the  size 
scale  where  the  quantum  effects  of  charge  quantization,  electron-electron  interaction  and 
quantum  interference  dramatically  affect  the  device  behavior.  The  physical  rules  governing 
charge  transport  are  fundamentally  different  on  the  two  sides  of  this  size  boundary:  for  larger 
devices  the  familiar  classical,  collective  charge  models  presently  used  work  very  well;  for 
smaller  devices  classical  models  break  down  and  predict  inaccurate  or  totally  incorrect  behavior. 
Both  fabrication  techniques  and  an  understanding  of  the  quantum  charge  transport  will  be 
necessary  to  realize  new  device  and  IC  designs  in  this  nano-size  regime. 

We  have  conducted  an  investigation  into  the  quantum  charge  transport  of  individual  quantum 
dot  (QD)  devices.  Our  two  main  objectives  in  this  work  were:  1)  to  investigate  experimentally 
the  impact  of  electron-electron  interactions  on  charge  transport  through  a  quantum  dot  and  2)  to 
investigate  new  fabrication  techniques  that  might  be  applicable  to  realize  single  electron  devices. 

2.  Significant  Progress 

In  a  typical  SET  design,  the  central  island  is  defined  by  tunneling  barriers  to  the  source  and 
drain  leads.  Many  of  the  first  SETs  featured  vertical  tunneling  junctions  fabricated  using 
electron  beam  lithography  and  shadow  evaporation  techniques."  Although  thin  oxide  tunneling 
barriers  narrower  than  100  nm  are  readily  produced  by  vertical  junctions,  lateral  tunneling 
junctions  offer  certain  advantages.  The  possibility  of  reduced  capacitances  due  to  smaller  lateral 
junction  areas  should  lead  to  higher  possible  operating  temperatures.  Furthermore,  the  lateral 
configuration  lends  itself  to  easier  integration  with  other  planar  devices.  We  have  been 
investigating  two  methods  for  the  fabrication  of  lateral  metal-insulator-metal  tunneling  barriers, 
atomic  force  microscope  (AFM)  oxidation  and  current-induced  local  oxidation  (CILO). 

The  AFM  is  typically  used  to  image  surface  features  almost  atomic  resolution.  The  key 
element  of  the  AFM  is  an  atomically  sharp  tip  suspended  from  the  end  of  a  cantilever.  In  contact 
mode,  the  tip  rides  over  features  as  it  traverses  the  surface  in  a  raster  pattern.  A  feedback  circuit 
maintains  a  constant  cantilever  deflection  by  adjusting  the  height  of  the  sample  with  a 
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piezoelectric  stage.  Conductive  AFM  tips  can  be  made  by  fabricating  the  tips  out  of  doped  Si.  or 
by  coating  standard  Si  or  SiN  tips  with  metal  films. 

AFM  oxidation  is  accomplished  by  applying  an  electrical  bias  between  a  conductive  AFM  tip 
and  a  thin  film,  which  locally  anodizes  the  film,  using  surface  adsorbed  water  as  an  electrolyte.' 
Translating  the  tip  across  the  film  surface  while  applying  a  bias  produces  oxide  lines  with  widths 
ranging  from  tens  to  hundreds  of  nanometers. 

We  studied  the  AFM  oxidation  process  by  investigating  various  conductive  tips, 
semiconductor  and  metal  films,  and  oxidation  parameters.  Thin  films  of  Ti.  Ni.  and  Cr  were 
applied  to  the  AFM  tips  in  an  electron  beam  evaporator.  The  coated  tips  were  studied  for  wear 
and  film  adhesion  with  a  scanning  electron  microscope.  Commercially  available  conductive  Si 
and  Ptir-coated  tips  were  also  used  and  compared.'* 

A  number  of  materials  were  also  studied  as  candidates  for  substrate  material.  Thin  metal 
films  were  evaporated  or  sputtered  onto  100  nm  of  thermally  grown  SiO^.  Molecular  beam 
epitaxy  (MBE)-grown  films  of  GaAs  and  NiAl  were  also  used.  The  NiAl  film  seemed  to  have 
particular  promise  for  excellent  oxide  line  uniformity  due  to  its  single-crystal  structure  and  very 
flat  surface."'  Furthermore,  the  aluminum  oxide  produced  had  potentially  strong  physical  and 
electrical  traits.  Unfortunately,  the  NiAl  film  proved  difficult  to  oxidize  and  the  GaAs  substrate 
on  which  the  NiAl  was  grown  provided  poor  isolation.  The  investigation  then  focused  on 
electron-beam  evaporated  films  of  Ti  with  thicknesses  ranging  from  3  to  10  nm. 

We  studied  control  over  the  oxide  linewidth  by  varying  the  AFM  tip  translation  speed  and 
the  applied  voltage.  We  found  thar  the  linewidth  could  be  reduced  by  increasing  the  tip 
translation  speed  and  lowering  the  applied  voltage.  Oxidation  would  not  proceed,  however,  if 
the  speed  was  raised  or  the  voltage  dropped  beyond  a  certain  point.  Importantly,  the  width  of  the 
oxide  line  was  also  found  to  be  sensitive  to  the  ambient  humidity,  the  shape  of  the  AFM  tip,  and 
local  surface  conditions.  The  oxide  height  is  determined  by  the  thickness  of  the  film.  In  the  case 
of  Ti,  the  oxide  occupies  roughly  twice  the  volume  of  the  original  metal,  so  a  TiOx  line  that 
completely  cut  through  a  3  nm  Ti  film  would  extend  about  3  nm  above  the  surface. 

For  device  fabrication,  tlje  Ti  films  were  patterned  using  contact  photolithography  and  a  wet 
etching  process  in  HF  to  create  large-scale  two  or  three-terminal  structures  consisting  of  a  2  pm- 
wide  line  or  T-junction,  respectively.  We  used  the  AFM  to  define  sub-micron  features  and 
attempted  many  different  structures,  investigating  the  capabilities  of  both  AFM  oxidation  in 
general,  and  the  translation  capabilities  of  our  specific  AFM.  Hysteresis  and  drift  in  the 
piezoelectrics  controlling  the  motion  of  the  AFM  had  to  be  accounted  for  in  the  software  scripts 
that  directed  the  movement.  Broad  oxide  lines  drawn  at  low  speeds  and  high  biases  were  used  to 
provide  electrical  isolation  and  to  define  certain  device  features.  High  speeds  and  low  biases 
produced  fine  oxide  lines  for  use  as  lateral  tunneling  barriers.  Figure  1  is  an  AFM  image  of  a 
device  fabricated  with  AFM  oxidation. 

The  electrical  barrier  height  of  the  oxide  barriers  was  determined  by  taking  current-voltage 
measurements  in  a  cryostat  from  200  K  to  room  temperature.  Plotting  the  current  versus 
temperature  at  different  bias  points,  we  extract  activation  energies.  We  account  for  the  image 
force  barrier  lowering  by  plotting  the  activation  energies  versus  the  square  root  of  the  bias  and 
extrapolate  back  to  zero  volts.  Contact  difficulties  limited  the  number  of  devices  that  could  be 
tested.  A  barrier  height  of  0.126  eV  was  measured  for  the  device  shown  in  Figure  2. 
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Figure  1.  AFM  image  of  a  structure  made  with  AFM  oxidation.  The  dark  gray  is  Ti  and  the  light  gray  is 
TiOx  extending  about  3  nm  above  the  Ti  surface.  The  broad  oxide  lines  on  the  sides  were  drawn  first 
under  low  speed,  high  bias  conditions.  The  two  narrow  lines  in  the  center  were  drawn  at  higher  speed  and 
lower  bias. 


Ti73  [21]  Barrier  Height 
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(a.)  (b.) 

Figure  2.  a.)  AFM  image  of  single  barrier  made  with  AFM  oxidation,  b.)  Zero  bias  barrier  height  of 
0. 1 26  eV  measured. 


Challenges  with  the  reproducibility  of  the  AFM  oxidation  process  lead  us  to  consider 
alternative  methods  for  the  fabrication  of  lateral  tunneling  barriers.  Current-induced  local 
oxidation,  which  generates  oxides  at  narrow  constrictions  in  conductor  lines,  appeared  to  be  a 
promising  candidate.^  CILO  takes  advantage  of  the  local  high  current  density  produced  in  a 
constriction.  The  early  stages  of  electromigration  damage  enhance  diffusion  of  oxygen  into  the 
metal  film,  increasing  the  oxidation  rate  in  the  constriction.  Joule  heating  also  increases  oxygen 
diffusivity  and  the  oxidation  reaction  rate.  A  narrow  oxide  line  quickly  grows  from  the  film 
surface  down  to  the  insulating  substrate.  At  this  point,  the  current  falls  dramatically,  and  the 
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process  naturally  halts  itself.  A  plot  of  the  current  during  the  CILO  process  is  shown  in  figure  3. 
Initially,  the  current  changes  veiy'  little  as  the  resistance  is  dominated  by  the  leads.  As  the  oxide 
extends  deeper  and  deeper  into  the  metal  film,  the  contribution  of  the  constriction  to  the  total 
resistance  soon  begins  to  be  evident.  Finally,  the  metal  line  is  pinched  off  and  the  current  passes 
entirely  through  the  barrier. 


Ti73  [7]  CILO  Current 
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Figure  3.  Plot  of  the  current  through  the  CILO  process.  As  the  oxide  grows  into  the  metal  film, 
the  current  is  confined  to  a  narrower  and  narrower  portal.  After  the  oxidation  is  complete,  all 
remaining  current  passes  through  the  oxide  barrier. 

We  began  investigating  CILO  with  a  hybrid  AFM-CILO  process  in  which  we  used  the  same 
patterned  Ti  films  fabricated  for  AFM  oxidation.  We  used  AFM  oxidation  to  define  sharp  and 
narrow  constrictions  in  2  pm  wide  Ti  lines.  Applying  biases  up  to  20  V  across  these  lines 
produced  current  densities  greater  than  10’  A/cm%  and  oxidation  took  place  in  the  gap.  An  AFM 
image  of  the  CILO  oxide  and  a  plot  of  the  current-voltage  characteristic  before  and  after  CILO 
are  shown  in  figure  4.  The  barrier  height  of  this  CILO  oxide  was  also  determined  in  the  same 
way  as  the  AFM  oxides,  and  was  found  to  be  0. 1 19  eV. 

To  improve  the  reproducibility  of  the  constrictions  and  ease  their  fabrication,  we  began  to 
focus  on  using  electron  beam  lithography  and  lift-off  to  form  the  initial  constrictions,  instead  of 
AFM  oxidation.  We  developed  a  process  to  achieve  150  nm  resolution  using  a  diluted  ZEP520 
resist  followed  by  electron  beam  evaporation.  Lift-off  was  performed  in  resist  stripper  while  in 
an  ultrasonic  bath. 

The  CILO  process  remains  poorly  understood,  however.  We  seek  to  investigate  the  relative 
importance  of  the  electromigration  and  heating  enhancements  of  the  oxidation.  Since  grain 
boundary  diffusion  generally  dominates  in  thin  films,  we  will  investigate  the  influence  of  the 
metal  film  grain  size  on  CILO,  and  hence  the  role  of  the  diffusion  of  vacancies  along  grain 
boundaries.  Samples  from  different  evaporators  and  samples  that  have  undergone  different 
periods  of  annealing  will  have  different  grain  sizes  and  may  react  differently  to  the  CILO 
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process.  We  will  also  investigate  the  Joule  heating  to  determine  how  hot  the  temperature  in  the 
gap  is  relative  to  its  surroundings. 


(a.) 


Ti73  [5]  l-V  Before  and  After  CILO 
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Figure  4.  a.)  AFM  image  of  CILO  oxide  generated  in  gap  formed  by  AFM  oxidation,  b.)  Plot  of  the 
current- voltage  characteristic  before  and  after  CILO.  c.)  Zero  bias  barrier  height  of  0.1 19  eV  measured. 

The  improved  understanding  of  CILO  will  be  useful  for  its  application  to  single  electron 
device  fabrication.  Single  CILO  barriers  may  be  used  as  tunneling  barriers  in  SETs,  but  this 
would  require  very  narrow  oxide  lines.  We  propose  a  novel  use  that  has  less  stringent 
fabrication  requirements.  A  layer  of  Au  could  be  deposited  prior  to  the  evaporation  of  Ti.  When 
less  than  a  nanometer  of  Au  is  deposited,  the  Au  tends  to  clump  into  islands.  With  a  reasonably 
sparse  distribution,  a  small  number  of  islands  will  be  found  directly  in  the  constriction  formed  in 
the  Ti  line.  A  CILO  oxide  can  then  be  formed  to  isolate  the  Au  island.  In  this  device,  electrons 
tunnel  through  only  a  part  of  the  barrier,  and  a  single  line  acts  as  both  source  and  drain  tunnel 
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junctions.  This  eases  the  requirement  that  lines  must  be  made  as  thin  as  possible  to  al low- 
tunneling.  A  schematic  of  the  proposed  device  is  showm  in  figure  5. 


Figure  5.  Side  view  schematic  of  proposed  device  using  CILO  to  isolate  Au  islands.  The  linewidth 
requirement  is  eased  since  d^  is  now  the  tunneling  distance  instead  of  d,. 
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The  Micromachined  Tunneling  Transistor  (MTT) 

The  focus  of  this  work  is  the  simulation,  fabrication,  and  testing  of  a 
Micromachined  Tunneling  Transistor  (MTT).  The  MTT.  as  shown  in  Fig  1 .  consists  of  a 
micromachined  relay  that  is  used  to  bring  two  gold  surfaces  to  within  several  nanometers 
of  one  another.  When  the  two  surfaces  are  this  clo.se.  electrons  can  tunnel  from  one 
surface  to  the  other.  This  tunneling  current  can  be  modulated  by  slight  movements  of  the 
relay,  with  one  angstrom  of  relay  movement  corresponding  to  a  2X  change  in  tunneling 
current.  The  modulation  of  the  width  of  the  tunneling  barrier  is  potentiallv  much  more 
efficient  than  the  standard  MOSFET.  which  modulates  the  height  of  a  potential  barrier. 


Fig  1  Micromachined  Tunneling  Transistor 


The  recent  focus  of  the  MTT  project  has  been  on  fabrication  and  testing  of  the 
MTT.  Despite  the  design  of  particularly  robust  hinge  structures,  initial  fabrication 
attempts  invariably  resulted  in  structures  that  were  stuck  down  to  the  substrate.  This 
problem  was  solved  by  using  a  Critical  Point  Drier  (CPD)  for  the  final  release  step 
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We  have  tested  several  different  configurations  of  the  MTT,  Figure  2  nIiou  ^  iliu 
tips  of  a  lateral  unit  that  was  created  using  the  MIMPS/Cronos  fahi  icaiion  pisv  cNs 
These  lateral  tunneling  units  exhibit  are  actuated  using  comb  drive  structures.  The 
actuation  mechanics  of  a  comb  drive  structure  eliminate  the  pull-in  instabilit>  that  is  a 
problem  in  the  classic  relay  devices.  However,  the  lateral  unit  is  subject  to  out-of-plane 
forces,  and  these  have  kept  us  from  establishing  continuous  tunneling  contact  in  the 
MUMPS  devices.  Lack  of  funding  and  progress  on  other  fronts  made  a  .second  re\  ision 
of  the  design  impossible. 


'i 


Figure  2:  Lateral  Unit 

The  most  successful  MTT  configuration  has  been  the  dual  anchor  lex  er,  show  n  m 
figure  3.  The  dual  anchor  devices  are  more  robust  and  thus  surx  ive  ihe  proeesMug  in 
greater  numbers.  They  are  ahso  stiffer.  which  limits  their  gain  and  increases  actual  ion 
voltages. 
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Fig  3:  Dual  anchor 


(with  bonding  pad  in  foreground) 

The  dual  anchor  devices  exhibit  the  classic  iclax  problem  ot  pull-in  Thus  we 
implemented  several  different  methods  of  stabilization  for  these  de\’ices.  The  I'lrsi 
method  was  leveraged  bending,  where  the  length  of  the  gate  electrode  is  shmiened  until 
the  tip  of  the  lever  can  be  actuated  smoothly  over  the  full  range  of  motion  of  the  de\  isc 
Several  of  the  dual  anchor  devices  using  this  method  have  exhibited  tunneling  current, 
showing  that  they  have  full  range  actuation.  The  IV  curve  of  a  device  with  a  \  or>  small 
pull-in  height  (and  thus  nearly  stable  full-range)  is  show  in  figure  4.  The  curve  show  s 
only  a  4- volt  hysteretic  loop,  an  order  of  magnitude  lower  than  that  predicted  for  a  full- 
gate  device  of  the  same  dimensions.  The.se  devices  are  still  under  testing. 
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Device  Current  (microamps) 
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-6 

Actuation  Voltage  (V) 


The  second  method  of  stabijization  implemented  for  the  dual  anchor  dc\  iccs  a 
raised  drain.  In  this  method,  the  drain  of  the  device  is  raised  far  enough  so  that  ii  cnici  ^ 
the  range  of  stable  motion  of  the  device.  A  scanning  electron  microscope  (SEM )  picture 
of  such  a  device  is  shown  in  Fig  5.  These  device  have  initial  source-drain  gaps  of  as  little 
as  0.1  microns,  and  have  been  actuated  several  thousand  times.  We  are  currentK 
working  on  a  new  revisipn  of  electronics  that  will  allow  these  devices  to  tunnel  w  ithout 
actuating  them  to  failure. 
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:  Raised  Drain 
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Electrical  Properties  of  Sub  100  nm  Silicon  Structures 


Semiconductor  device  dimensions  are  rapidl>  shrinking  into  the  tar  suh-nneroi' 
regime.  As  dimensions  reach  the  order  of  100  nm.  the  low  frcquenc\  noise,  uhieh  is 
generated  by  the  interaction  of  channel  carriers  with  nearby  traps  in  the  gate  oxide,  ean 
no  longer  be  described  by  the  characteristic  1/f  spectrum. [Hung.  Jaxaramanl  .Xi  tins 
scale,  the  effects  of  individual  traps  are  visible  as  random  telegraph  signals  in  the  room 
temperature  channel  conductance. [Kirton]  Devices  with  a  single  trap  active  in  the 
measurement  range  have  been  extensively  studied  in  the  literature,  but  the  possibilitx  of  a 
“zero-trap”  device,  one  with  no  low  frequency  noise  due  to  traps,  has  receixed  mmimal 
attention.  This  research  investigates  the  low  frequency  noise  sources  underix mg  ilic  trap 
noise  through  the  study  of  two  types  of  surrounding-gate  transistors.  The  first,  a 
cylindrical  surrounding-gate  transistor,  is  u.sed  to  as.sess  the  possibilitx  of  a  "/cro-trap" 
device  because  its  small  channel  area.  0. 1  um  length  by  0. 16  urn  xx  idth.  i^  prcdicicU  lo 
have  an  average  of  0.5  active  traps  per  device.  The  second  dex  icc.  a  ilcplciion-m.>dc  i  a 
surrounding-gate  transistor,  is  u.sed  to  drive  the  current  axvax  from  the  oxide  ti  .ip^  w  !iij. 
the  noise  spectrum  is  measured.  Fabrication  and  preliminary  electrical  chai  acton /.at  ion 
of  both  devices  has  been  completed. 


The  cylindrical  surrounding-gate  transistor  is  a  vertical  transistor  built  from  a 
silicon  nanopillar.  Nanopillars  are  fabricated  by  using  electron  beam  lithographx .  plasma 
etching,  and  self-limiting  oxidation  to  form  a  p-type  silicon  core  7.X0  nm  tall  and  less  titan 
50  nm  wide.  Gate  oxidation  and  gate  polysilicon  deposition  folloxw  The  surrounding- 
gate  is  formed  with  a  spacer  etch,  and  the  source  and  drain  at  the  bottom  and  top  of  the 
pillar  are  doped  with'  ah  angled  implant.  The  whole  structure  is  passivated  with  deposited 
oxide  and  vias  are  etched  to  the  source  and  gate  and  filled  with  deposited  metal. 

Electrical  contact  to  the  top  of  the  pillar  is  accomplished  by  the  use  of  a  focused  ixm  beam 
to  first  etch  a  hole  to  the  top  of  the  pillar  and  then  deposit  a  tungsten  plug.  An  .Sf-.M 
image  of  the  cross  section  of  the  device  following  passivation  is  show  n  in  F-igurc  I  a.  An 
FIB  image  of  the  cross  section  of  the  fini.shed  device  showing  etched  top  via  and  uingsien 
plug  is  show  in  Figure  lb. 
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Figure  1:  a)  SEM  image  of  cylindrical  surrounding-gate  transistor  cross  section  show  mg 
50  nm  wide  silicon  nanopillar  with  surrounding  gate.  Source  and  drain  arc  doped  ai  loj' 
and  bottom  of  pillar,  b)  FIB  image  of  cylindrical  surrounding-gaic  transisun  cross 
section  showing  FIB  etched  top  via  filled  with  tungsten  plug. 


The  DC  electrical  characteristics  of  a  single  cylindrical  surrounding-gaie  iraiisisioi 
show  gate  modulation  of  the  drain  current.  The  device  is  biased  aiul  nieasm  ed  usme  me 
source/monitor  units  of  an  HP4145B  semiconductor  parameter  anal> /er,  I'lie  suhMi.iie 
and  source  are  grounded  while  the  drain  voltage  is  swept  and  the  gate  \oliage  is  stepped 
An  Id-Vd  sweep  for  a  single  device  is  shown  in  Figure  2. 
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Figure  2:  Drain  current  versus  drain  voltage  at  various  gate  \oltage  steps  loi  a  e\  lindi  lea 
surrounding-gate  transistor. 


Noise  measurements  will  be  taken  in  the  time  domain  by  amplifying  the 
fluctuations  in  the  drain  current  at  fixed  gate  and  drain  bias  and  measuring  them  on  an 
oscilloscope.  For  a  single  active  trap,  these  fluctuations  will  take  the  form  of  a  random 
telegraph  signal  with  the  trap  switching  between  a  filled  and  empty  state.  As  the  de\  ice 
is  biased  away  from  .the  trap  such  that  the  trap  is  no  longer  active,  the  noise  due  to 
underlying  sources  will  become  visible  as  the  random  telegraph  signal  amplitude  falls 
below  that  of  the  underlying  noise. 


The  depletion-mode  fin  surrounding-gate  transistor,  rather  than  being  c>  liiuli  !>.  all\ 
.symmetric,  is  elongated  in  one  lateral  direction.  Aside  from  the  elongateil  dimciisii'n  .in 
the  channel  doping,  this  device  shares  the  same  fabrication  steps  w  nh  the  c\  hiuli  ical 
surrounding-gate  transi.stor.  A  diagram  of  the  depletion-mode  fm  siiirounding  g.ae 
transistor  is  shown  in  Figure  3a  and  an  SEM  image  ol  a  gated  ilcpleiion-iiuule  Im 
surrounding-gate  transistor  is  shown  in  Figure  3b. 
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Figure  3:  a)  Diagram  of  depletion-inoilo  I’m  Nuriinindmg  g, iic  ii.idm';.':  ■'  ''i  \! 

in  oblique  view  of  gated  depletion-mode  fin  surrounding-gate  transistor  K'loiv 
passivation. 


The  low  frequency  (1/f)  noise  spectrum  of  depletion-mtKic  I'm  surnnmdmg  gak 
transistors  will  be  measured  as  the  gate  bias  forces  electrons  awa\  from  the  mieriaee. 
Noise  remaining  when  electrons  no  longer  interact  with  the  surface  w  ill  be  due  to  the 
non-surface  low  frequency  noise  sources  to  be  studied.  While  no  studies  of  do\  ices  w  nh 
this  geometry  and  size  have  been  reported  in  the  literature,  a  stud\  b\  Amberiadm  and 
van  derZiel  shows  that  as  the  conducting  region  in  large  area  .VIO.S  eapaeiioiN  i'^  ioived 
away  from  the  silicon/silicon  dioxide  interface,  the  magnitude  of  the  l/l  noise 
decreases.  [Amberiadi  s] 
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FIBER-BASED  OPICAL  FREQUENCY  CONVERSION: 

A  NOVEL  DEVICE  AND  WDM  NETWORK  APPLICATIONS 

PRINCIPAL  INVESTIGATOR:  Leonid.  G.  Kazovsky 
RESEARCH  STUDENT:  Min-Chen  Ho 


1  INTRODUCTION 

Wavelength  converters  are  important  building  blocks  in  WDM  networks.  They  allow 
non-blocking  wavelength  routing  and  de-central  ized  wavelength  management  [Yoo] 
[Willner].  Although  wavelength  converters  based  on  O/E/0  techniques  provide  good 
signal  quality,  all-optical  wavelength  converters  have  the  advantages  of  modulation  format 
and  bit-rate  transparency,  and  can  provide  multi-wavelength  conversion  simultaneously. 
Several  all-optical  wavelength  conversion  techniques  have  been  proposed  and 
demonstrated  [Stubkjaer]  [Brener]  [Watanabe]  [Aso].  Fiber-based  optical  wavelength 
converters  (FWCs)  utilizing  the  parametric  amplification  in  fibers  are  also  all-optical 
wavelength  converters.  Besides  "the  common  benefits  that  all-optical  wavelength 
conversion  techniques  can  provide,  FWCs  can  potentially  have  high  gain  and  wide 
bandwidth  [Marhic-a].  Recent  research  efforts  in  WDM  system  show  interest  in 
wavelength  regions  outside  the  conventional  EDFA  band,  i.e.,  L-band  [Sun]  and  S-band 
[Kani],  in  order  to  increase  transmission  capacity  in  optical  fiber.  FWCs'  potential  wide 
bandwidth  would  help  meet  the  future  need  of  WDM  systems. 

During  the  first  year  of  our  project  (May  1997  -  December  3U*,  1997),  we  have 
performed  theoretical  and  experimental  studies  of  one-pump  and  two-pump  FWC. 
During  the  second  year  (Jan  U*,  1998  -  December  3U*,  1998),  the  main  focus  of  our 
research  was  to  develop  techniques  to  further  increase  the  operating  bandwidth  of  FWC. 
During  the  third  year  (Jan  U',  1999  -  December  3L’.  1999),  we  have  continued  the  effort 
in  developing  wideband  FWCs  and  studied  the  interaction  of  FWCs  with  other  nonlinear 
phenomena  in  fibers,  i.e.,  stimulated  Raman  scattering.  We  spent  the  remaining  period  of 
this  project  (Jan  L‘,  2000  -  March  3U',  2001)  developing  FWCs  for  practical  system 
application.  We  proposed  and  demonstrated  a  method  using  two  pumps  for  suppressing 
idler  linewidth  broadening. 

Section  2  summarizes  the  research  achievement  supported  by  this  project.  This 
project  directly  supported  six  conference  publications,  four  journal  publications,  and  one 


Ph.  D.  dissertation,  which  will  be  published  in  June  2001.  Sections  3  and  4  list  the  titles 
of  the  publications  and  the  dissertation. 

2  SUMMARY  OF  RESEARCH  RESULTS 

Highly  nonlinear  dispersion-shifted  fiber  (HNLF)  is  a  key  component  in  this  project. 
Its  high  nonlinearity  and  flat  dispersion  slope  help  to  increase  the  conversion  efficiency, 
and  reduce  the  fiber  length  required  to  achie\'e  a  certain  wavelength  conversion  efficiency. 
Section  2.1  introduces  HNLFs  and  describes  an  FWC  constructed  with  an  HNLF. 

Although  HNLFs  have  the  desired  features  for  making  FWCs.  their  zero-dispersion 
wavelengths  are  not  easy  to  control  during  manufacturing  due  to  the  small  core  size.  We 
have  proposed  and  demonstrated  a  periodic  compensation  technique  to  overcome  this 
problem.  Section  2.2  summarizes  the  results  of  this  technique. 

Theory'  shows  that  the  bandwidth  of  FWCs  using  HNLF  could  be  over  200  nm. 
Under  these  circumstances  the  FWG  bandwidth  w'ould  overlap  the  pump-induced  Raman 
gain  bandwidth.  We  have  studied  the  combined  effects  of  FWC  and  Raman  gain 
theoretically  and  experimentally.  Section  2.3  shows  experimental  results  for  a  200  nm- 
bandwidth  FWC.  and  summarizes  the  influence  of  the  Raman  effect  encountered  in  our 
studies. 

The  performance,. of  cw-FWCs  has  been  hampered  by  pump-induced  converted- 
signal  spectrum  broadening,  due  to  the  required  pump  phase  modulation.  We  theoretically 
investigate  and  experimentally  demonstrate  a  technique  to  cancel  the  idler  (converted- 
signal)  broadening  by  using  two  pumps  phase-modulated  180  degrees  out  of  phase. 
Section  2.4  states  the  problem  and  summarizes  the  results  of  our  cancellation  technique. 

2.1.  Demonstration  of  FWC  in  Highly  Nonlinear  Dispersion-Shifted  Fiber 

In  earlier  experiments.  FWCs  were  demonstrated  in  conventional  dispersion-shifted 
fibers  (DSFs).  DSFs  are  designed  for  transmission  rather  than  for  FWC  applications,  and 
they  have  the  following  disadvantages  for  FWC  applications:  (1)  a  relatively  low 
nonlinearity  coefficient,  7  =  2km  'W  ^2)  a  fairly  large  dispersion  slope, 

=0.07  ps?nm  "km  ^  result  of  (1).  for  a  given  pump  power  ^  the  fiber  length 

^  required  for  gain  >  10  dB  is  large,  ranging  from  200  m  for  a  pulsed  pump,  to  12  km  for 
a  cw  pump.  And  because  of  (1)  and  (2).  appreciable  bandwidth  (10s  of  nm)  can  be 


achieved  only  if  the  (pulsed)  pump  wavelength  is  within  about  1  nm  of  the  zero- 
dispersion  wavelength  • 

Recently,  several  fiber  manufacturers  started  to  develop  highly-nonlinear  dispersion- 
shifted  fibers  (HNLF).  This  type  of  fiber  is  designed  to  enhance  nonlinear  effects  in 
fibers.  The  enhancements  appear  in  two  ways.  First,  the  nonlinear  coefficient  is  increased 
because  of  larger  (nonlinear  refractive  index)  and  smaller  effective  area.  Higher 

nonlinearity  reduces  the  power  or  fiber  length  required  to  achieve  the  same  gain.  Second, 
the  dispersion  slope  is  reduced,  resulting  in  a  wider  bandwidth  [Marhic-a]. 

We  are  the  first  to  demonstrate  an  FWC  using  HNLF.  Figure  1  shows  the 
experimental  result.  Further  information  can  be  found  in  [Ho-a]. 


Figure  1 :  Experimental  results  for  an  FWC  using  HNLF. 

2.2.  FWC  Bandwidth  Expansion  Using  a  Dispersion  Compensation  Technique 

Although  HNLFs  have  the  desirable  features  for  FWC  applications,  their  zero 
dispersion  wavelengths  (“^o  s)  are  not  easy  to  control  during  manufacturing.  For  example, 

the  HNLF  presented  in  the  first  demonstration  of  FWC  with  HNLF  (Section  2.1)  has 
/Iq  =  1591  nm  jg  f^j.  wavelength  region  where  high  power  pumping  sources 

are  currently  available,  i.e.,  1530-1565  nm.  With  pump  wavelength  =  ^542  nm  jj^  qjjj. 
experiment,  the  separation  between  ^p  and  is  49  nm,  which  results  in  an  FWC  with 

very  small  gain  bandwidth.  In  order  to  solve  this  problem,  we  proposed  a  dispersion 
compensation  method  that  involves  splicing  together  pieces  of  fiber  with  dispersion  of 

opposite  signs  at  various  locations  in  the  HNLF.  This  compensated  fiber  has  equivalent 
^0  closer  to  the  pump  wavelength. 

We  have  performed  theoretical  studies  [Marhic-b]  to  quantitatively  analyze  the 
feasibility  of  this  technique.  We  also  performed  experiments  [Ho-b]  to  confirm  the 
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theor>’.  In  the  experiments,  we  compensated  the  dispersion  of  HNLF  by  splicing  pieces 
of  fibers  of  opposite  sign  dispersion  at  various  locations  in  the  HNLF.  The  three 
composite  fibers  we  constructed  to  demonstrate  this  technique  are  illustrated  in  Figure  2. 
Note  that  we  use  standard  single-mode  fiber  (SMF)  as  the  dispersion-compensating  fiber 
(DCF)  in  our  experiment,  because  it  provides  large  positive  dispersion 
(D=  16.3ps?nm"'km'' )  in  1550  nm  window.  The  pump  is  pulse-modulated  to  achieve 

1 1  W  peak  pow'er.  The  measured  gain  curves  for  all  three  composite  fibers  are  plotted  in 
Figure  3  along  with  theoretical  cur\'es.  The  results  show  that:  ( 1 )  the  experimental  results 
agree  well  with  the  theoiy:  and  (2)  the  proposed  dispersion  compensating  technique 
increases  both  FWC’s  conversion  gain  and  bandwidth. 

HNI J:  40  m 

(a) 

HNI-F:  20  m  seirmen! 

^  1 

...  ^ 

DCF:  4  in 

(b) 

^  ‘'HVI  F-  10  iiv4ym,»nt _ ^ 

DC' I  :  ! ni  s-omnent 

(c) 

Figure  2:  Assembly  of  the  fibers:  (a)  HNLF  only;  (b)  HNLF  +  1  segment  of  DCF  segment;  (c) 
HNLF  +  3  segments  of  DCF. 


Figure  3:  Gain  measurements  of  pulsed  one-pump  FWCs  with  dispersion-compensated  HNLFs. 
Solid  lines  are  theoretical  plots  and  dots  are  experimental  data  points.  The  m's  denote  the 
number  of  DCF  segments. 
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2.3.  Broadband  FWC  and  Its  Interaction  with  Raman  Gain 

Early  work  with  FWCs  demonstrated  relatively  small  bandwidths.  of  the  order  of  1 
nm.  We  have  shown  that  under  certain  circumstances.  FWCs  could  in  principle  exhibit 
bandwidths  as  large  as  several  hundred  nanometers  [Marhic-a].  and  we  have 
experimentally  demonstrated  a  120  nm  bandwidth  FWC  as  part  of  this  project  [Ho-c]. 

We  have  continued  this  broadband  FWC  effort  by  studying  theoretically  and 
experimentally  FWCs  with  a  bandwidth  of  the  order  of  200  nm.  Under  these 
circumstances,  the  FWC  bandwidth  significantly  overlaps  the  pump-induced  Raman  gain 
bandwidth,  which  peaks  at  about  1 10  nm  from  the  pump,  on  the  long  wavelength  side.  It 
then  becomes  necessary  to  study  the  combined  effect  of  the  two  separate  gain 
mechanisms.  We  have  studied  the  theory'  applicable  to  this  regime,  and  also  performed 
experiments  to  verify  the  theoretical  predictions. 

The  gain  medium  for  our  OPA  experiment  was  a  highly  nonlinear  fiber  (HNLF) 

provided  by  Furukawa  Electric.  Table  1  lists  the  HNLF  parameters.  As  can  be  seen,  the 
zero-dispersion  wavelength  K  of  this  fiber  is  much  closer  to  the  available  pump  source. 

Therefore,  an  FWC  with  bandwidth  over  200  nm  is  possible  with  this  HNLF. 


Table  1:  HNLF  parameters. 


Parameter 

Value 

Zero  dispersion  wavelength  Aj 

ID4U.2  nm 

Dispersion  slope  (at  ) 

0.031  ps?nm”'  ?km"‘ 

Nonlinearity  coefficient  / 

18  km-'  ?W-' 

Length  ^ 

ZUTn 

P'^^iaX^r) 

-  5.8^0-^“  s"  ?m-‘ 

We  first  characterized  the  Raman  gain  of  this  HNLF.  and  the  result  is  shown  in 
Figure  4  (a).  We  then  performed  the  pulsed-FWC  experiment,  and  the  results  are  shown 
in  Figure  4  (b).  In  Figure  4  (b).  the  dashed  curve  marked  '‘FWC  only”  shows  the 
hypothetically  pure  FWC  gain  without  Raman  contribution.  It  is  symmetric,  whereas  the 
experimental  data  exhibits  asymmetry,  with  higher  gain  on  the  longer  wavelength  side,  a 
feature  clearly  attributable  to  the  Raman  gain.  The  solid  curve  marked  'TWC-i-Raman”  is 
plotted  with  the  Raman  effect.  The  experimental  data  are  in  good  agreement  with  this 
curve,  indicating  the  validity  of  the  model  we  developed.  Note  also  that  the  two  curves 
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(“FWC  only"  and  "FWC+Raman")  confirm  that  the  pump-induced  Raman  gain 
contribution  is  relatively  small  compared  to  the  FWC  gain. 


In  this  work,  we  have  demonstrated  an  FW^C  with  gain  in  excess  of  1 0  dB  over  a  208 
nm  bandwidth,  which  to  our  knowledge  is  the  largest  to  date  for  an\-  type  of  fiber-based 
wavelength  converter.  We  also  confirmed  that  the  Raman  gain  pro\’ides  onh’  a  relative!)- 
small  perturbation  to  the  broadband  and  high-efficiency  FWC. 
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Figure  4:  (a)  Raman  gain  spectrum.  The  frequency  shift  is  defined  as  the  pump  frequency 
minus  the  signal  frequency,  (b)  Experimental  results.  The  dots  are  experimental  data,  and  the 
solid  and  dashed  lines  are  theoretical  curves  obtained  with  and  without  the  Raman  effect. 


2.4.  Cancellation  of  Idler  Broadening  by  Two-Pump  FWCs 

The  efficiency  and  bandwidth  of  FWCs  strongly  depend  on  the  pump  power.  Pump 
power  levels  of  hundreds  of  milliwatts  are  cufrently  necessary  to  achieve  reasonable 
performance.  Unfortunately,  at  this  high  power  level,  another  nonlinear  phenomenon. 
Stimulated  Brillouin  Scattering  (SBS)  also  occurs.  It  causes  reflection  of  most  pump 
power,  and  thus  reduces  the  pump  power  going  through  the  fiber.  SBS  has  been  widely 
studied,  and  techniques  have  been  proposed  and  demonstrated  to  suppress  it.  Since  SBS 
has  only  a  very'  narrow  bandwidth,  a  common  technique  to  suppress  it  is  to  artificially 
broaden  the  pump  linewidth  by  modulating  the  pump.  This  process  is  called  pump 
dithering.  Either  AM.  FM.  or  PM  can  be  used  to  achieve  this  goal.  At  high  pump  powers, 
pump  broadening  of  several  GHz  may  be  required. 

Although  pump  dithering  suppresses  SBS,  it  creates  another  problem,  illustrated  in 
Figure  5  (a).  If  (pump)  is  shifted  by  ,  then  (idler,  or  converted  signal)  is  shifted 

by  .  As  a  result,  the  pump  frequency  dithering  is  transferred  to  the  idler,  multiplied 
by  a  factor  of  two.  This  leads  to  idler  spectrum  broadening  of  several  GHz  at  high  pump 
power. 
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To  solve  the  idler  linewidth  broadening  problem,  a  technique  was  proposed  in 
[Yang].  The  principle  is  shown  in  Fig.  1(b).  Instead  of  using  a  one-pump  configuration,  a 
two-pump  OPA  [Marhic-c]  is  used.  In  a  two-pump  configuration,  the  idler  broadening 
can  be  cancelled,  when  the  two  pumps  are  adjusted  carefully,  as  illustrated  in  Fig.  1(b). 


(a)  (b) 


Figure  5:  (a)  Idler  broadening  due  to  pump  dithering,  (b)  Idler  broadening  suppression  using 
two-pump  scheme. 

We  have  theoretically  investigated  and  experimentally  implemented  this  technique 
by  two  different  methods  -  FM  modulation  [Ho-d]  and  PM  modulation  [Ho-e].  Figure  6 
shows  part  of  the  results  of  the  cancellation  experiment.  Figure  6  (a)  show's  the  idler 
spectrum  when  cancellation  is  applied.  For  comparison.  Figure  6  (b)  shows  the  idler 
spectrum  when  no  cancellation  is  applied.  The  comparison  shows  that  without 
cancellation,  the  modulation  due  to  pump  dithering  is  lower  by  4  dB  than  the  center  peak, 
and  that  with  cancellation,  the  modulation  is  17  dB  lower  than  the  center  peak.  The 
spread  of  the  sidebands  is  also  reduced  with  cancellation,  from  6  GHz  to  2  GHz.  Further 
results  show  that  the  idler  can  have  almost  the  same  quality  as  the  output  signal.  This 
approach  could  help  make  fiber  FWM-  and  OPA-based  wavelength  converters  practical. 
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Figure  6;  Results:  (a)  idler  spectrum  with  broadening  suppression;  (b)  idler  spectrum  without 
broadening  suppression. 
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This  research  continues  to  examine  technologies  for  low-power,  high-performance,  reliable,  and 
reusable  architectures  for  distributed  adaptive  signal  processing.  The  objective  is  to  demonstrate  a  design 
methodology  which  is  not  only  adaptable  but  also  trainable  with  capability  for  massively  parallel  and 
distributed  processing.  It  will  integrate  distributed  network  techniques  and  provide  a  design 
environment  for  automatic  reconfiguration  and  rapid  system  prototyping. 


I.  Minimum  Energy  Network  Protocol  -  ^ 

A  challenge  in  designing  distributed  networks  is  finding  an  efficient  protocol  for  multiple  access.  The 
first  issue  that  any  such  protocol  must  address  is  the  temporal  and  or  frequency  separation  of  the  transmit 
and  receive  slots  of  any  node.  This  means  either  that  a  node  must  transmit  and  receive  in  different  time 
slots  or  that  it  must  transmit  and  receive  on  widely  separated  frequencies  using  filters  with  sharp  out-of- 
band  rejection. 

This  requirement  presents  a  considerable  challenge  in  a  distributed  network  where  the  aim  is  peer-to- 
peer  communications  between  geographically  distributed  nodes.  For  example,  in  a  communication 
transmission  from  Node  A  to  Node  B,  Ae  system  designer  must  insure  that  the  receiver  Node  B  is  in  the 
receive  mode  independent  of  the  physical  locations  of  Node  A  and  Node  B.  Further,  the  designer  must 
make  this  possible  for  every  pair  of  nodes  within  a  transmission  range  of  each  other. 

We  have  a  designed  a  position-based  multiple  access  protocol  which  leverages  two  important 
technologies  that  have  emerged  recently: 

(1)  Global  Positioning  System,  In  our  related  DARPA-sponsored  research,  we  have  already 
demonstrated  the  viability  of  small,  low-cost,  low-power  GPS  receivers.  A  radio  network 
architecture  for  this  program  uses  the  GPS  position  and  absolute  time  information  for 
both  the  estimation  of  transmit  power  fall-off  with  respect  to  distance  and  multiple  access 
duplexing  and  user  signature  assignments.  The  former  of  these  techniques  was 
demonstrated  in  July  1998  while  the  latter  is  being  demonstrated  in  our  current  research. 

(2)  Multi-User  Detection:  Low  complexity  multi-user  detectors  and  interference  cancelers  are 
recent  advances  in  the  physical  layer  design  of  communication  systems.  In  our  multiple 
access  protocol,  we  make  use  of  these  detectors  at  the  receiver  of  each  node  to  achieve 
high  efficienq^  in  spectrum  usage. 
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Our  position-based  duplexing  strategy  allows  the  designer  to  trade-off  spectral  efficiency  for  multi¬ 
user  communications  with  the  fraction  of  the  whole  deployment  region  with  which  a  node  can 
communicate.  The  idea  here  is  to  divide  up  the  deployment  region  into  cells  which  are  called  duplexing 
cells.  Each  cell  is  assigned  a  different  duplexing  pattern  which  any  node  in  that  cell  must  use  for  its 
transmission  and  reception.  An  example  for  a  duplexing  pattern  is  {T,R,T,T,R,T,...},  where  T  denotes  a 
Transmit  slot  and  R  denotes  a  Receive  slot  for  that  user.  In  the  above  example,  the  user  must  transmit  in 
the  first  slot,  receive  in  the  second,  transmit  in  the  third  and  fourth  slots,  etc. 

We  have  found  an  optimal  duplexing  scheme  such  that  for  a  block  consisting  of  such  cells,  each  node 
in  this  block  of  cells  can  communicate  with  every  other  node  in  the  same  block  in  a  manner  that  makes  the 
best  use  of  the  available  bandwidth. 

In  contrast  to  traditional  multiple  access  protocols  such  as  CSMA,  our  protocol  allows  users  to 
transmit  at  the  same  time  without  collisions.  Since  traditional  protocols  do  not  use  multi-user  detection  at 
the  receivers,  they  cannot  avoid  collisions  from  taking  place.  In  our  protocol,  which  leverages  multi-user 
detection,  simultaneous  reception  from  many  users  is  possible. 

The  important  point  in  this  discussion  is  that  the  assignment  of  duplexing  patterns  and  user 
signatures  occur  in  a  completely  distributed  fashion  since  these  assignments  are  based  on  position.  Each 
node  knows  its  own  position  via  the  GPS,  since  there  is  a  particular  pre-assigned  duplexing  code  and  a 
pre-assigned  family  of  user  signatures  for  the  cell  in  which  the  node  is  located,  the  node  knows  with 
which  pattern  and  which  signature  to  transmit. 

We  are  currently  simulating  the  position-based  multiple  access  protocol  (PBMA)  in  OPNET,  which  is 
a  versatile  network  simulator. 


IL  Likelihood  of  System  States  for  Non-Stationary  Processes 

In  our  last  report,  we  described  an  approach  that  combines  classical  linear  regression  techniques  with 
instantaneous  error  to  define  the  likelihood  that  the  coefficients  of  a  linear  predictor  adequately  capture  a 
system's  state.  Such  a  methocd  could  be  leveraged  for  monitoring  complicated  mechanical  objects  in  real¬ 
time  for  signs  of  critical  wear.  For  this  research  program,  the  end  objective  is  to  deploy  numerous 
intelligent  adaptive  systems  over  a  wide  geographic  area  for  data  gathering,  relying  upon  communication 
systems  to  integrate  local  processing  into  a  distributed  consensus.  This  approach  —  when  compared  to 
conventional  centralized  systems  —  increases  performance,  reduces  latency  and,  most  importantly, 
reduces  the  overall  system  cost.  For  this  approach  to  work,  however,  each  individual  local  processor 
must  be  labeled  with  a  likelihood  measure  of  correct  operation. 

Our  work  in  the  estimation  of  non-stationary  time-series  processes  has  resulted  in  a  novel  probability 
model  that  can  assign  a  likelihood  measure  to  accuracy  of  adaptive  predictors.  This  model  combines 
short-term  error  and  long-term  accumulated  error  to  accommodate  adaptation  as  a  non-stationary  process 
transpires. 

For  a  linear  predictor,  the  instantaneous  error  (e.g..  Normalized  Least  Mean  Squares)  determines  how 
well  (or  how  poorly)  the  current  coefficients  capture  the  state  of  the  system.  One  can  view  error  in  terms 
of  a  probability:  as  error  increases,  the  system  is  likely  to  be  moving  from  one  state  to  another.  If  error  is 
consistently  high,  a  number  of  different  states  are  likely.  If  error  is  low,  the  state  is  specified  with  a  higher 
likelihood.  Thus  the  error  is  directly  proportional  to  predictor  performance.  One  can  build  a  probability 
density  model  based  upon  this  error  to  quantitatively  determine  the  relationship  between  error  and 
performance. 
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In  a  complementary  fashion,  linear  regression  techniques  can  be  used  to  describe  the  long-term 
confidence  in  the  predictor  coefficients.  An  accumulated  sum-of-squares  error  is  the  basis  for  confidence 
intervals  on  a  system's  regressors,  which,  in  this  case,  are  the  predictor  coefficients  themselves.  A  system 
with  large  accrued  error  is  not  being  modeled  well  and  will  result  in  large  confidence  intervals  on  its 
regressors.  Similar  to  the  instantaneous  error  situation,  a  probability  density  model  based  upon 
confidence  interval  size  can  be  built  that  explicitly  depicts  the  relationship  between  error  and 
performance. 

Our  novel  approach  integrates  both  of  the  probability  models  to  form  a  multi-dimensional  joint 
probability  density.  This  model  allows  one  to  determine  the  probability  of  "correct  prediction"  as  a 
function  of  instantaneous  error  and  regressor  confidence  intervals.  Since  confidence  intervals  are  defined 
on  individual  regressors,  or  confidence  hyper-volumes  are  defined  on  the  vector  of  regressors,  the  density 
function  is  truly  multi-variate.  In  the  model,  all  parameters  are  normalized  (between  0  and  1). 

Combining  the  probability  measure  in  this  framework  of  a  "state  space  road-map"  means  that  it  can  be 
used  as  a  instantaneous  confidence  measure  of  route  that  the  system  proceeds  upon  within  that  space. 
Simulation  results  have  provided  preliminary  validation  of  the  mathematical  bases  for  the  models.  Real- 
world  validation  is  being  pursued  by  applying  the  method  to  the  prediction  of  the  vibration  signature  of 
rotor-craft  gear-boxes  (transmissions)  as  *ey  mechanicaUy  fail.  Actual  data  from  US  Navy  drive-train 
test-stands  are  used  to  validate  the  method  and  underlying  assumptions. 

We  believe  the  approach  has  much  broader  application  in  the  general  field  of  state  identification  for 
non-stationary  processes.  For  example,  in  a  communication  system,  the  joint  probability  density  can  be 
used  for  channel  identification.  Where  chartfiels  are  highly  non-stationary,  e.g.,  mobile  telephony,  a 
reliable  estimate  of  the  channel  configuration  can  be  used  to  increase  capacity  and  reduce  power 
consumption  of  both  the  transmitter  and  receiver. 
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1  Scientific  Objectives 

We  apply  techniques  of  information  theory  to  problems  of  information  compression,  network 
information  flow,  investment,  and  quantum  data  transmission.  The  investigations  cover  the 
following  specific  areas:  -  / 

1.  Universal  lossy  compression 

2.  Optimal  signaling  under  the  worst  noise 

3.  Characterizing  and  estimating  rare  events 

4.  Duality  between  channel  coding  and  source  coding  with  side  information 

5.  Transmission  over  channel  with  side  information 

6.  Optimal  investment 

7.  Quantum  multiple-access  channel 

2  Summary  of  Research 

2.1  Universal  Lossy  Compression 

We  address  a  long-standing  open  problem  in  data  compression,  that  of  finding  an  effi¬ 
cient  extension  of  the  celebrated  Lempel-Ziv  algorithm  to  the  case  of  lossy  compression. 
In  [Kontoyiannis  I]  we  present  a  universal  algorithm  for  encoding  memoryless  sources  at  a 
fixed  distortion  level,  which  arises  as  a  generalization  of  the  Lempel-Ziv  lossless  compres¬ 
sion  scheme  to  the  lossy  case.  We  show  that  the  compression  performance  of  the  proposed 
scheme  is  asymptotically  optimal  with  respect  to  bounded  single-letter  distortion  measures, 
and  argue  that  it  is  of  reasonable  encoding  complexity.  The  main  result  is  related  to  the 
results  in  the  papers  [Kontoyiannis  II]  and  [Kontoyiannis  III]. 
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2.2  Signaling  Under  the  Worst  Noise 

In  this  work  [Diggavi  and  Cover]  we  find  the  worst  constrained  noise  processes  for  the  addi¬ 
tive  noise  channel.  If  the  signal  and  noise  covariances  matrices  lie  in  bounded  convex  sets, 
we  show  that  a  random  Gaussian  codebook  and  a  decoding  scheme  based  on  a  Gau.ssiaii 
metric  achieve  the  minimax  rate.  We  demonstrate  the  solution  to  the  game-theoretic  prob¬ 
lem  when  we  have  a  banded  matrix  constraint  (specified  up  to  a  certain  covariance  lag)  on 
the  noise  covariance  matrix  and  show  that  under  certain  conditions  (sufficient  input  power) 
the  maximum  entropy  noise  is  also  the  solution  to  the  minimax  problem. 

2.3  Characterizing  and  Estimating  Rare  Events 

In  many  application  areas  ranging  from  manufacturing,  supply  chains,  communication  net¬ 
works  and  financial  engineering,  intrinsic  variability  causes  various  risk  and  reliability  con¬ 
cerns.  Typically,  an  attempt  is  made  to  mitigate  risk  to  a  certain  degree.  Consequently,  in 
many  cases  variability  caused  violations  become  something  of  a  “rare  event."  In  this  work 
[Zeevi  and  Glynn]  and  [Zeevi]  we  consider  certain  aspects  of  extreme  value  theory  and  the 
role  it  plays  in  the  analysis  of  atypical  system  behavior.  Specifically,  we  discuss  how  extreme 
values  might  be  useful  in  characterizing  these  so-called  rare  events,  while  "extremal  statis¬ 
tics,”  estimators  based  on  extreme  values,  can  be  used  in  estimating  and  predicting  their 
occurrence. 

2.4  Duality  Between  Channel  Coding  and  Source  Coding  With 
Side  Information 

In  [Chiang  and  Cover  I]  and  [Chiang  and  Cover  II]  we  show  that  the  duality  between  channel 
capacity  and  data  compression  is  retained  when  state  information  is  available  to  the  sender, 
to  the  receiver,  to  both.,. or  to  neither.  We  also  present  a  unified  theory  for  eight  special 
cases  of  channel  capacity  and  rate  distortion  with  state  information,  which  extends  existing 
results  to  arbitrary  pairs  of  i.i.d  state  information  {S\,  S-2)  available  at  the  sender  and  at  the 
receiver,  respectively.  We  also  show  that  the  general  formula  of  channel  capacity  assumes 
the  same  form  as  the  Wyner-Ziv  rate  distortion  function  with  state  information. 

2.5  Transmission  Over  Channel  With  Side  Information 

In  [Costa]  Costa  showed  that  the  capacity  of  a  channel  with  additive  white  Gaussian  noise 
and  power  constrained  input  is  not  affected  if  some  extra  i.i.d  noise  sequence  is  added  to 
the  output  of  the  channel,  as  long  as  the  full  knowledge  of  this  extra  noise  sequence  is  given 
to  the  encoder.  In  this  work  [Yu,  et  al\  we  generalize  Costa’s  result  to  the  case  where  the 
additive  noise  is  colored  and  not  necessarily  stationary  or  ergodic.  We  show  that  the  capacity 
for  such  a  channel  is  also  not  affected  by  the  extra  additive  noise  sequence. 

In  related  work  [Chiang,  Sutivong,  and  Cover],  we  study  a  conflict  between  transmitting 
pure  information  and  state  information  over  a  channel  where  the  transmitter  has  a  complete 
knowledge  about  the  state  of  the  channel.  The  novelty  in  characterizing  the  tradeoff  between 
transmitting  pure  information  and  state  information  arises  primarily  because  of  the  inability 
to  encode  and  decode  the  state  information.  The  tradeoff  region  is  typically  difficult  to 
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obtain  even  for  a  simple  channel.  In  [Sutivong.  Chiang,  and  Cover]  we  characterize  the 
optimal  tradeoff  for  the  binary  channel  and  also  prove  the  optimality  of  the  extreme  points 
of  the  tradeoff  region  for  the  additive  Gaussian  channel. 

2.6  Optimal  Investment 

In  [Cover  and  Julian]  we  compare  the  theoretical  and  empirical  performance  of  horizon-free 
universal  portfolio  for  a  large  number  of  stock  pairs  using  real  stock  market  data.  We  con¬ 
sider  two  scenarios:  with  and  without  side  information,  and  with  and  without  short  selling. 
We  also  derive  a  correction  factor  for  the  asymptotic  performance  of  the  universal  portfolio. 
Finally,  we  propose  the  concept  of  a  parallel  artificial  market  and  show  empirically  that  max¬ 
imum  likelihood  with  prepending  provides  a  computationally  efficient  method  to  compute 
the  universal  portfolio. 

In  related  work  [Iyengar  and  Cover],  we  formulate  the  problem  of  growth  optimal  in¬ 
vestment  in  horse  race  markets  with  proportional  costs  and  study  growth  optimal  strategies 
for  both  stochastic  horse  races  as  well  as  races  where  one  does  not  make  any  distributional 
assumptions.  Our  result  extends  all  known  results  on  frictionless  horse  race  markets  to  their 
natural  analog  in  markets  with  costs. 

2.7  Quantum  Multiple- access  Channel 

In  [Klimovitch]  we  analyze  a  quantum  adder  channel,  i.e.  the  extension  of  a  classical  multiple- 
access  additive  binary  channel  to  the  quantum  regime,  in  which  quantum  bits  (qubits)  rather 
than  classical  bits  are  transmitted.  Quantum  entanglement  between  different  trasmitters 
and  the  receiver  results  in  a  significant  expansion  of  the  capacity  region  of  the  channel  in 
comparison  with  its  classical  counterpart.  For  tw’o  uncoordinated  users,  the  maximum  rate 
of  a  single  user  doubles,  while  the  maximum  rate-sum  increases  by  a  factor  larger  than  two. 
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ROBUST  ADAPIVE  FILTERING  ALGORITHMS 


PRINCIPAL  INVESTIGATOR:  T.  Kailath 
GRADUATE  STUDENT:  B.  Haider 

For  the  period  1/98  -  6/98  we  concentrated  on  developing  various  robust  algorithms  for  signal 
processing  and  communication  applications.  This  is  a  continuation  of  the  work  reported  earlier  for 
the  period  5/97  -  12/97.  The  main  focus  of  the  study  was  to  extend  the  mixed  algorithms 

developed  for  the  static  state-feedback  case  as  reported  earlier  to  more  general  settings  in  order  to 
include  the  following  cases: 

•  Fixed  order  dynamic  state- feedback. 

•  Fixed  order  output  feedback. 

This  work  was  completed  by  Ph.D.  student  B.  Haider  who  graduated  in  June  1998.  Some 
publications  on  this  are  under  review. 

1  Mixed  Estimation  and  Control 

The  mixed  H^/H^  approach  to  controller  design  is  an  attempt  to  incorporate  optimal  performance 
and  guaranteed  robustness /arguably  the  two  most  desirable  properties,  into  a  single  controller.  In 
design  a  quadratic  performance  criterion  is  minimized  under  the  assumption  that  an  underlying 
model  of  the  system  and  the  statistical  nature  of  the  exogenous  signals  are  known.  As  a  result, 
the  performance  of  the  optimal  controllers  is  susceptible  to  model  uncertainties.  In  the 
approach  such  assumptions  are  not  made,  and  a  minimax-type  of  criterion  is  adopted;  however,  the 
solutions  may  be  over-conservative  in  terms  of  the  performance.  Fortunately,  given  an  upper 
bound  on  the  performance  there  is  a  multiplicity  of  controllers,  and  in  the  mixed  approach 
we  exploit  this  multiplicity  to  obtain  a  controller  with  the  optimal  performance  subject  to  an 
bound.  To  pose  the  mixed  problem  as  a  contained  optimal  control  problem,  consider 

the  general  2-input  2-output  LTI  system  model  show  in  Figure  1. 


Figure  1:  System  model  for  the  mixed  H^/H^  problem. 
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Mixed  H^/H^  Problem:  Given  the  system  Gmix?  90clI  of  mixed  H^/H^  approach  is  to  find 
a  controller  K,  if  it  exists,  such  that  the  norm  of  the  closed-loop  map  from  input  i/m  to  cj  is 
minimized  subject  to  a  pre-specified  bound  (j)  on  the  norm  of  the  closed-loop  map  from  input 
Woe  ^0  Zoc’ 

As  it  turns  out,  the  mixed  i/^/i^oc  problem,  though  easy  to  motivate,  is  surprisingly  hard  to 
solve  analytically.  Thus,  despite  its  importance  and  considerable  progress  in  robust  control  theory 
the  mixed  problem  by  and  large  remains  an  open  problem.  In  the  absence  of  complete 

analytic  solution  to  the  mixed  H^/H^  problem,  the  need  for  efficient  numerical  algorithms  is 
apparent.  In  the  earlier  work  we  developed  eflRcient  numerical  algorithms  for  computing  the  optimal 
mixed  controller  for  the  static  state-feedback  case.  In  this  report  we  present  algorithm  for  more 
general  setting,  i.e.,  the  dynamic  state- feedback  case  and  output  feedback  case. 

As  a  first  step  towards  the  development  of  such  algorithms,  the  design  of  the  optimal  mixed 
problem  must  be  reformulated  in  a  finite  dimensional  optimization  framework.  The  size  of  the 
resulting  optimization  problem  is  determined  by  the  order  of  the  optimal  mixed  controller.  However, 
a  solution  to  the  optimal  mixed  problem  is  not  guaranteed  to  be  of  bounded  order,  let  alone  of 
a  fixed-order.  It  is  hard  to  formulate  a  finite  dimensional  optimization  problem  to  obtain  such 
variable  order,  possibly  infinite  order,  solutions.  This  suggests  the  need  to  put  some  restriction 
on  the  controller  order.  We  pose  this  restriction  on  the  controller  order  by  considering  only  the 
so-called  fixed-order  solutions  where  the  order  of  the  controller  is  pre-specified.  By  restricting 
the  search  to  controllers  .ofi  fixed-order,  we  obtain  a  meaningful  finite  dimensional  constrained 
optimization  problem. 

The  steps  required  for  the  formulation  of  the  optimization  problem  are  as  follows.  Using  the 
state-space  description,  we  represent  the  set  of  fixed-order  controllers  by  a  set  of  fixed  size  matrices. 
For  convenience,  the  system  matrices  are  stacked  into  a  single  design  variable  K,  Next,  the 
norm  of  the  closed-loop  map  Tz2W2  is  expressed  as  an  explicit  function  of  the  design  variable  K,  and 
the  norm  bound  on  the  closed-loop  map  T^^woo  is  expressed  as  a  bilinear  matrix  inequality 
(BMI)  in  the  design  variable  K  and  a  Riccati  variable  X.  Using  these  expressions,  the  design  of 
fixed-order  mixed  controllers  is  converted  into  a  problem  of  minimization  of  the  cost  function 
subject  to  a  BMI  constraint.  Both  the  cost  function  and  the  constraint  set  expressed  as 
BMI  are  non-convex  in  the  design  parameter  K.  There  is  no  one-step  method  for  solving  this 
non-convex  constrained  minimization  problem  and  we  propose  an  iterative  approach. 

The  key  idea  behind  these  iterative  algorithms  is  to  break  the  difficult  non-convex  problem  into 
a  series  of  subproblems  that  can  be  readily  solved.  Toward  this  goal  we  construct  subproblems 
that  are  convex  and  hence,  easily  solvable.  Convex  subproblems  are  derived  from  the  original 
problem  by  replacing  the  non-convex  cost  function  by  a  quasi-convex  upper  bound  and  the 
non-convex  constraint  set,  expressed  as  a  BMI,  by  a  convex  inner  approximation  with  an  LMI 
description.  Moreover,  we  show  that  these  subproblems  can  be  converted  to  a  special  class  of 
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convex  problems,  known  as  semidefinite  programming  (SDP)  problems,  for  which  there  are  efficient 
algorithms  available. 

To  illustrate  the  performance  improvement  achieved  by  fixed  order  dynamic  mixed  controller 
over  the  conventional  central  controller  we  consider  a  3-ed  order  LTI  system  [3,  1].  The  norm  of 
the  closed-loop  map  Tzjiuj  is  plotted  as  a  function  of  bound  7  and  shown  in  Figure  2.  Figure 
2  also  shows  the  norm  corresponding  to  the  central  and  the  static  mixed  controller.  The  plot 
shows  that  the  dynamic  full-order  state-feedback  controller  yields  lower  norm  compared  to  the 
static  controller,  especially  for  the  smaller  values  of  7  close  to  the  optimal  norm  jopt-  This  is 

a  significant  difference  between  the  mixed  problem  and  the  optimal  or  the  suboptimal 

H°^  problems. 


Figure  2:  Algorithm-IV:  H"^  performance  of  full-order  (ric  =  3)  dynamic  controller  (Example  5). 

Using  similar  ideas  we  also  developed  algorithm  for  both  static  and  fixed-order  dynamic  output 
feedback  cases.  For  more  details  on  these  algorithms  see  [2]. 
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